Advanced training methods and new network topologies for hybrid MMI-connectionist/HMM speech recognition systems

نویسندگان

Christoph Neukirchen

Gerhard Rigoll

چکیده

This paper deals with the construction and optimization of a hybrid speech recognition system that consists of a combination of a neural vector quantizer (VQ) and discrete HMMs. In our investigations an integration of VQ based classi cation in the continuous classi er framework is given and some constraints are derived that must hold for the pdfs in the discrete pattern classi er context. Furthermore it is shown that for ML training of the whole system the VQ parameters must be estimated according to the MMI criterion. A novel training method based on gradient search for Neural Networks that serve as optimal VQ is derived. This allows faster training of arbitrary network topologies compared to the traditional MMI-NN training. An integration of multilayer MMI-NNs as VQ in the hybrid discrete HMM based speech recognizer leads to a large improvement compared to other supervised and unsupervised single layer VQ systems. For the speaker independent Resource Management database the constructed hybrid MMI-connectionist/HMM system achieves recognition rates that are comparable to traditional sophisticated continuous pdf HMM systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance of hybrid MMI-connectionist/HMM systems on the WSJ speech database

In this paper, a hybrid MMI-connectionist / hidden Markov model (HMM) speech recognition system for the Wall Street Journal (WSJ) database is presented. The HMM part of this system uses discrete probability density functions (pdf). The neural network (NN) is used to replace a classical vector quantizer (VQ) like a k-means or LBG algorithm, which are typically used in discrete HMM systems. The N...

متن کامل

Large vocabulary speech recognition with context dependent MMI-connectionist / HMM systems using the WSJ database

In this paper we present a context dependent hybrid MMI-connectionist / Hidden Markov Model (HMM) speech recognition system for the Wall Street Journal (WSJ) database. The hybrid system is build with a neural network, which is used as a vector quantizer (VQ) and an HMM with discrete probablility density functions, which has the advantage of a faster decoding. The neural network is trained on an...

متن کامل

Efficient computation of MMI neural networks for large vocabulary speech recognition systems

This paper describes, how to train Maximum Mutual Information Neural Networks (MMINN) in an efficient way, with a new topology. Large vocabulary speech recognition systems, based on a Hybrid MMI/connectionist HMM combination, have shown good performance on several tasks [1] and [2]. MMINNs are trained to maximize the mutual information between the index of the winning output neuron (Winner-Take...

متن کامل

Speaker adaptation for hybrid MMI/connectionist speech-recognition systems

In this paper we present a new adaptation technique for our hybrid large vocabulary continuous speech recognition system. In most adaptation approaches the HMM parameters are reestimated. In our approach, however, we train a speaker independent continuous speech recognizer, then we keep the HMM parameters fixed and we train a second network, which transforms the features of the adaptation data ...

متن کامل

Connectionist ’viterbi Training: a New Hybrid Method for Continuous Speech Recognition

these procedures are well suited to speech recognition applications, in which Hybrid methods which combine hidden Markov models (HMMs) and connectionist techniques take advantage of what are. believed to be the strong points of each of the two approaches: the powerful discrimination-based learning of connectionist networks and the time-alignment capability of HMMs. Connectionist Viterbi Trainin...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1997

Advanced training methods and new network topologies for hybrid MMI-connectionist/HMM speech recognition systems

نویسندگان

چکیده

منابع مشابه

Performance of hybrid MMI-connectionist/HMM systems on the WSJ speech database

Large vocabulary speech recognition with context dependent MMI-connectionist / HMM systems using the WSJ database

Efficient computation of MMI neural networks for large vocabulary speech recognition systems

Speaker adaptation for hybrid MMI/connectionist speech-recognition systems

Connectionist ’viterbi Training: a New Hybrid Method for Continuous Speech Recognition

عنوان ژورنال:

اشتراک گذاری